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mRNA, with known or unknown sequences, to its translated protein to form a cognate pair. The cognate pair is selected based upon 
desired properties of the protein or the nucleic acid. This method also includes the evolution of a desired protein or nucleic acid 
molecule by amplifying the nucleic acid portion of the selected cognate pair, introducing variation into the nucleic acid, translating 
the nucleic acid, attaching the nucleic acid to its protein to form a second cognate pair, and re-selecting this cognate pair based upon 
desircd properties. 
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IN VITRO EVOLUTION OF NUCLEIC ACIDS AND ENCODED POLYPEPTIDE 

Backsjound of the Invention 

Field of the Invention 

5 The present invention relates to compositions and methods for the selection of 

nucleic acids and polypeptides. 
Description of the Related Art 

Ligand-receptor interactions are of interest for many reasons, from elucidating 
basic biological site recognition mechanisms to drug screening and rational drag design. It 
10 has been possible for many years to drive in vitro evolution of nucleic acids by selecting 

molecules out of large populations that preferentially bind to a selected target, then 
amplifying and mutating them for subsequent re-selection (Tuerk and Gold, Science 
249:505 (1990)). 

The ability to perform the selection process with proteins would be extremely 

15 usefid. This would permit in vitro design and productipn of proteins that bind specifically 

to chosen ligands. The use of proteins, as compared to nucleic acids, is particularly 
advantageous because the twenty diverse amino acid side chains in proteins have far more 
binding possibilities than the four similar chains in nucleic acid side. Further, many 
biologically and medically relevant ligands bind proteins. 

20 Both nucleic acid and protein evolution methods require access to a large and highly 

varied population of test molecules, a way to select members of the population that exhibit 
the desired properties, and the ability to reproduce the selected molecules with mutated 
variations to obtain another large population for subsequent selection. 

Prior attempts to develop a protein evolution method were primarily limited by the 

25 inability of the proteins to reproduce themselves and the inability to linic a polypeptide's 

encoding mRNA with the translated product. Additionally, the generation of large peptide 
libraries and scre^odng metbods have, until recently, required that the process have an in 
vivo expression step. Examples incliide yeast two- or three-hybrid, yeast display and phage 
display methods (Fields and Song, Nature 340:245 (1989); Licitra and Liu, PNAS 

30 93:12817 (1996); Boder and Wittrup, Nat Biotechnol 15:553 (1997); and Scott and Smith, 

Science 249:386 (1990)). In vivo methods suffer from various disadvantages, including a 
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limited library size and cumbersome screening steps. Additionally, midesired selective 
pressures can be placed on the generation of variants by cellular constraints of the host. 

In vitro methods have been developed more recratly, using prokaryotic and 
eukaryotic in vitro translation systems, such as ribosome display (Mattheakis et al., PNAS 
'91:9022 (1994); Hanes and Pltickthun, PNAS 94:4937 (1997); Jermutus et al.. Current 
Opinion in Biotechnology 9:534 (1998)). These methods link the protein and its encoding 
mRNA with the ribosome and the entire complex is screened against a ligand of choice. 
Potential disadvantages of this method include the large size of the ribosome, which could 
interfere with the screening of the attached, and relatively tiny, protein. • 

In 1997, two groups of workers developed an in vitro method of attaching a protein 
to its coding sequence during translation by using the ribosomal peptidyl transferase with 
puromycin attached to a linker DNA (Szostak et aL, International Patent Publication WO 
98/31700; Roberts and Szostak PNAS 94:12297 (1997); Nemoto et al., FEBS Letters 
414:405 (1997), all herein incorporated by reference). Once the coding sequence and 
peptides are linked, the peptides are exposed to a selected ligand. Selection or binding of 
the peptide by the ligand also selects the attached coding sequence, which can then be 
reproduced by standard means. Both Roberts and Szostak and Nemoto et al. used the 
technique of attaching a puromycin molecule to the 3' end of a coding sequience by a DNA 
linker or other non-translatable chain. Puromycin is a tKNA acceptor stem analog which 
accepts the nascent peptide chain under the action of the ribosomal peptidyl transferase and 
binds it stably and irreversibly, thereby halting translation. These methods suffer firom 
certain disadvantages. For example, tihe coding sequence encoding each peptide must be 
known and be modified both initially and between each selection. Thus, the methods of 
Roberts and Nemoto cannot be used to select native unknown mRNAs. Further, the 
modification of the codiiig sequence adds several steps to the process. Finally, the attached 
puromycin on the linker molecules may compete in the translation reaction with the native 
tRNAs for the A site on the ribosome reading its coding sequence or a nearby ribosome, 
and could thus 'poison" the translation process, just as would unattached puromycin in the 
translation reaction solution. Inadvertent interactions between puromycin and ribosomes 
could result in two kinds of reaction non-specificity: prematurely shortened proteins and 
proteins attached to the wrong message. There are reports in the prior art that indicate that 
the avidity of the A site and the peptidyl transferase for the puromycin may be modulated 
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by Mg^ concentration (Roberts, Curr. Opin. Chem. Biol. 3:268 (1999), herein incorporated 
by reference). Although Mg^ concentration may be titrated to control for the first kind of 
non-specificity, i.e. premature termination of translation, it will not affect the second type, 
i.e. inaccurate mRNA-protein linkage. 
5 Thus, a need exists for an in vitro nucleic acid-based protein evolution system 

which does not necessarily require initial knowledge of the nucleic acid's sequence or 
repeated chemical modification of the nucleic acids, and which can accurately link a 
mRNA to its protein. 

Summary of the Invention 

10 The present invention provides compositions and methods to select and evolve 

desired properties of proteins and nucleic acids. In various embodiments, the cmrent 
invention provides modified tEiNA's and tRNA analogs. Other embodiments include 
methods for generating polypeptides, assays enabling selection of individual members of 
the population of polypeptides having desired characteristics, methods for amplifying the 

15 nucleic acids encoding such selected polypeptides, and methods for generating new variants 

to screen for enhanced properties. 

In several embodiments, the present invention permits the attachment of a protein to 
its message without requiring modification of native mKNA, although modified mKNTA 
may still be used. The specificity of the methods embodied in various aspects of the 

20 cmrent invention are determined by the specificity of the codon-anticodon interaction. 

In a preferred embodiment, the invention permits the selection of nucleic acids by 
selecting the proteins for which they code. This may be accompHshed by connecting the 
protein to its cognate mRNA at the end of translation, which in turn is done by connecting 
both the protein and mRNA to a tRNA or tRNA analog. 

25 A preferred embodiment of the invention includes a tRNA molecule capable of 

covalently linking a nucleic acid encoding a polypeptide and the polypeptide to the tE(NA, 
wherein the linkage of the nucleic acid occurs on a portion of the tRNA other than the 
linkage of the polypeptide, the tRNA comprising a linking molecule associated with the 
anticodon of the tRNA. Preferably, an amino acid or amino acid analog is attached to the 3' 

30 end of a tRNA molecule by a stable bond to gmerate a stable aminoacyl tElNA aaoalog 

(SATA). 



-3- 



wo 01/90414 



PCTAJSOl/16210 



Other embodiments include a mKNA comprising a psoralen, preferably located in 
the 3' region of the reading frame, more preferably at the most 3' codon of the reading 
frame, most preferably at the 3' stop codon of the reading frame. In preferred 
embodiments, linkage between the tRNA and the mKNA is a cross-linked psoralen 
molecule, more preferably a furan-sided psoralen monoadduct. 

Several embodiments of the present invention include a method of stably linking a 
nucleic acid, a tRNA, and a polypeptide encoded by the mKNA together to form a linked 
mRNA-polypeptide complex. In a preferred embodiment, the nucleic acid is an roRNA, 
The method can further comprise providing a plurality of distinct nucleic acid-polypeptide 
complexes, providing a ligand with a desired binding characteristic, contacting the 
complexes with the hgand, removing unbound complexes, and recovering complexes 
bound to the ligand. 

Several methods of the current invention involve the evolution of nucleic acid 
molecules and/or proteins. In one embodiment, this invention comprises amplifying the 
nucleic acid component of the recovered complexes and introducing variation to the 
sequence of the nucleic acids. In other embodiments, the method further comprises 
translating polypeptides from the amplified and varied nucleic acids, linking them together 
using tRNA, and contacting th^i with the ligand to select another new population of bound 
complexes. Several embodiments of the present invention use selected protein-mRNA 
complexes in a process of in vitro evolution, especially the iterative process in which the 
selected mKNA is reproduced with variation, translated and again connected to cognate 
protein for selection. 

Brief Description of the Drawings 

Figure 1 illustrates schematically one example of the complex formed by the 
mRNA and its protein product when linked by a modified tRNA or analog. As shown, a 
codon of the mRNA pairs with the anticodon of a modified tE(NA and is covalently 
crossLinked to a psoralen monoadduct by UV irradiation. The translated polypeptide is 
linked to the modified tRNA via the ribosomal peptidyl transferase. Both linkages occur 
while the ixiRNA and nascent protein are held in place by the ribosome. 

Figure 2 illustrates schraiatically an example of the in vitro selection and evolution 
process, wherein the starting nucleic acids and their protein products are linked (e.g., 
according to Figure 1) and are selected by a particular characteristic exhibited by the 
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protein. Proteins not exhibiting the particular characteristic are discarded and those having 
the characteristic are amplified with variation, preferably via amplification with variation of 
the roRNA, to form a new population. In various embodiments, nonbinding proteins will 
be selected. The new population is translated and linked via a modified tRNA or analog, 
and the selection process is repeated. As many selection and amplification/mutation rounds 
as desired can be performed to optimize the protein product. 

Figure 3 illustrates one method of construction of a tRNA molecule of the 
invention. In this embodiment, the 5' end of a tRNA, a nucleic acid encoding an anticodon 
loop and having a molecule enable of stably linking to mRNA (here, psoralen), and the 3' 
end of tRNA modified with a terminal puromycin molecule are hgated to form a complete 
modified tRNA for use in the in vitro evolution methods of the invention. 

Figure 4 describes two alternative embodiments by which the crosslinking molecule 
psoralen can be positioned to be capable of Unking tiie mRNA with tiie tRNA in the 
methods of the invention. A first embodiment includes linking the psoralen monoadduct to 
the mRNA, and a second embodiment includes linking the psoralen to the anticodon of the 
tRNA. Psoraleii can either be monoadducted to the anticodon or the 3* terminal codon of 
the reading firame for known or partially known messages. This can be done in a separate 
procedure firom translation, i.e. before translation occurs. 

Detailed Description of the Preferred Embodiment 
Various aspects of the present invention use a tRNA mechanism that links 
messenger RNA (mRNA) to its translated protein product, forming a ''cognate pair." In 
several embodiments, mRNAs whose sequence is not known can be expressed, its protein 
charact^ized through a selection process against a Ugand with desired or selected 
properties, and nucleic acid evolution — resulting in protein evolution — can be performed in 
vitro to arrive at moleciiles with enhanced properties. The cognate pairs are preferably 
attached via a linking tRNA, modified tRNA, or tRNA analog. Li a preferred embodimCTit, 
the tRNA is connected to the nascent peptide by the ribosomal peptidyl transferase and to 
the mRNA through an ultraviolet induced cross link between the anticodon of the tRNA or 
tRNA analog and the codon of the RNA message. This can be done by, for example 
thiouracil, but in a preferred method, the linker is a psoralen crosslink made fi-om a 
psoralen monoadduct pre-placed on either the mRNA or preferably on the tRNA anticodon 

-5- 



wo 01/90414 



PCT/USOl/16210 



of choice. Preferably, a tRNA stop anticodon is selected. A stop codon/anticodon pair 
selects for fiill length transcripts. One skilled in the art will understand lhat a mKNA not 
having a stop codon may also be used and that any codon or nucleic acid triplet may be 
used. A tRNA having an anticodon which is not naturally occurring can be synthesized 
according to methods known in the art (e.g. Figure 3). 

The terms **protein/* **peptide," and '^polypeptide" are defined herein to mean a 
polymeric molecule of two or more units comprised of amino acids in any form (e.g., D- or 
L- amino acids, synthetic or modified amino acids capable of polymerizing via peptide 
bonds, etc.), and these terms may be used interchangeably herein. 

The term "pseudo stop codon" is defined herein to mean a codon which, while not 
naturally a nonsense codon, prevents a message fix>m being further translated. A pseudo 
stop codon may be created by using a "stable aminoacyl tRNA analog" or SATA, as 
described below. In this manner, a pseudo stop codon is a codon which is recognized by 
and binds to a SATA. Another method by which to create a pseudo stop codon is to create 
an artificial system in which the necessary tRNA having an anticodon complementary to 
the pseudocodon is substantially depleted. Accordingly, translation will stop when the 
absent tRNA is required, i.e. at the pseudo stop codon. One skilled in the art will 
appreciate that are numerous ways to create a pseudo stop codon as defined herein. 

The formation of comections between mRNA and its protein product generally 
requires a tRNA or tRNA analog with certain characteristics. In several embodiments of 
the current invention, the tRNA or tRNA analog Avill have a stable peptide acceptor. This 
modification changes the tRNA or tRNA analog such that after it accepts the nascent 
peptide chain by the action of the ribosomal peptidyl transferase, it holds the chain in a 
stable manner such that the peptidyl transferase cannot detach it. This roay be 
accomplished by using a bond such as a 2' ester on a 3' deoxy adenosine or an amino "acyl 
tRNA^.^" which can bind to the ribosome, accept the peptide chain, and then not act as a 
donor in the next transpeptidation (ChinaU et al., Biochem. 13:3001 (1974); Krayevsky 
and Kukhanova, Prog. Nuc. Acid Res 23:1 (1979) and Sprinzl and Cramer Prog. Nuc. Acid 
Res 22: 1(1979)). 

In a preferred embodiment, an amino acid or amino acid analog is attached to the 3' 
end of the tRNA or tRNA analog by a stable bond. This stable bond contrasts the labile, 
high energy ester bond that connects these two in the native structure. The stable bond not 
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only protects the bond from the action of the peptidyl transferase, but also preserves the 
structure during subsequent steps. For convenience, this modified tRNA or tllNA analog 
will be referred to as a "stable aminoacyl tRNA analog" or SATA. As used herein, a SATA 
is an entity which can recognize a selected codon such Hxat it can accept a peptide chain by 
the action of the ribosomal peptidyl transferase when the cognate codon is in the reading 
position of the ribosome. The peptide chain will be bound in such a way that the peptide is 
bound stably and cannot be unattached by the peptidyl transferase. Preferably, the selected 
codon is recognized by hydrogen bonding. 

One method for creating a SATA was published in 1973 (Fraser and Rich, PNAS 
70:2671 (1973)). This method involves the conversion of a tRNA, or tRNA analog, to a 3'- 
amino-3*-deoxy tRNA. This is accomplished by adding a 3'-amino-3'-deoxy adenosine to 
the end of a native tRNA with tRNA nucleotidyl transferase after removing the native 
adenosine from it with snake venom phosphodiesterase. This modified tRNA is then 
charged with an amino acid by the respective aminoacyl tRNA synthetase (aaRS). Fraser 
and Rich used an aaRS in which the tRNA is charged on the 3', rather than the 2\ hydroxyl. 
The anuno acid is bound to the tRNA by a stable amide bond rather than the usual labile 
high-energy ester bond. Thus, when it accepts a peptide from ribosomal peptidyl 
transferase it will stably hold the peptide and not be able to donate it to another acceptor. 

hi a preferred method, the SATA will be attached to the translated message by a 
psoralen cross link between the codon and anticodon. Psoralen cross links are 
preferentially made between sequences that contain complementary 5' pyrimidine-purine 3' 
sequences especially UA or TA sequences (Cimino et al., Ann. Rev. Biochem. 54:1151 
(1985)). The codon coding for the SATA, or the linking codon, can be PYR-PUR-X or X- 
PYR-PUR, so that several codons may be used for the linking codon. Conveniently, the 
stop or nonsense codons have this configuration. Using a codon that codes for an amino 
acid may require minor adjustments to the genetic code, which could complicate some 
applications. Therefore, in a preferred embodiment, a stop codon is used as the linking 
codon and the SATA functions as a nonsense suppressor in that it recognizes the linking 
codon. One skilled in the art, however, will appreciate that, with appropriate adjustments to 
the system, any codon can be used. 

Fraser and Rich did their work in E. coli, but the most effective in vitro translation 
systems are in eukaryotes The use of prokaryotic suppressors in eukaiyotic translation 
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systems appears to be feasible (Geller and Rich Nature 283:41 (1980); Edwards et al PNAS 
88:1153 (1991); Hou and Schimmel Biochem 28:6800 (1989)). They are primarily limited 
by tihie resident aaRS's. This limitation is overcome by various embodiments of the preset 
invCTtion because the tRNA or analog can be charged in the prokaryotic system and then 
purified according to established methods (Lucas-Lenard and Haemii, PNAS 63:93 (1969)). 

In several embodiments of the current invention, acceptor stem modifications 
suitable for use in the fRNAs and analogs can be produced by various methods known in 
the art. Such methods are found in, for example, Sprinzl and Cramer, Prog. Nuc. Acid Res. 
22:1 (1979). In an altemative embodiment, "transcriptional tRNA", i.e. the sequence of the 
tRNA as it would be transcribed rather than after the post-transcriptional processing, leads 
to the atypical and modified bases that are common in tRNAs. These transcriptional 
tRNAs are capable of functioning as tElNAs (Dabrowski et al., EMBO J. 14: 4872, 1995; 
and Harrington et al., Biocherh. 32: 7617, 1993). Transcriptional tRNA can be produced by 
transcription or can be made by connecting commercial RNA sequences (such as those 
available from Dharmacon Research Inc., Boulder, CO) together, piece-wise as in Figure 3, 
or in some combination of established methods. For instance, the S' phosphate and 3' 
puromycin are commercially available attached to oligoribonucleotides. These pieces can 
be coimected together using T4 DNA ligase, as is well-known in the art (Moore and Sharp, 
Science 256: 992, 1992). Alternatively, in a preferred embodiment, T4 RNA ligase is used 
(Romaniuk and Uhlenbeck, Methods in Enzymology 100:52 (1983)). 

In several embodiments of the present invention, psoralen is monoadducted to the 
SATA by construction of a tRNA from pieces including a psoralen linked oUgonucleotide 
(Fig. 3) or by monoadduction to a native or modified tRNA or analog (Fig. 4). 

In several embodiments, translation will stop when tiie nascent protein is attached to 
the SATA by the peptidyl transferase. When a large number of ribosomes are in this 
position the SATA and the mRNA will be coimected with UV Ught. In a preferred method 
this will be accomplished by having a psoralen crosslink formed. Psoralens have a fijoran 
side and a pyrone side, and they readily intercalate between complementary base pairs in 
double stranded DNA, RNA, and DNA-RNA hybrids (Cimino et al., Ann. Rev. Biochem. 
54:1 151 (1985)). Upon irradiation with UV, preferably in the range of 320 nm to 400 nm, 
cross linking will take place and leave the staggered pyrimidines covalently bound. By 
either forming crosslinks and photo reversing them or by using selected waveloagths, it is 
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possible to fonn monoadducts, described more fully below. These will be either pjorone 
sided or fiiran sided monoadducts. Upon further irradiatioii, the furan sided monoadducts 
can be covalently orosslinked to complementary base pairs. The pyrone sided monoadducts 
cannot be further crosslinked. The formation of the furan sided psoralen monoadduct 
(MAf) is also done according to established methods. In a preferred method, the psoralen 
is attached to the anticodon of the SATA. However, psoralen can also be attached at the 
end of the reading frame of the message, as depicted in Figure 4. 

Methods for large scale production of purified MAf on oligonucleotides are 
described in the literature (e.g., Speihnann et al., PNAS 89:4514, 1992), as are methods that 
require less resoiurces, but have some non-cross-linkable pyrone sided psoralen monoadduct 
contamination (e.g., U.S. Patent No. 4,599,303; Gamper et al., J. Mol. Biol. 197: 349 
(1987); Gamper et al., Photochem. Photobiol. 40:29 (1984)). In several embodiments of 
the current invention, psoralen labeling is accomplished by using either method. In a 
preferred embodiment, furan sided monoadducts will be created using visible light, 
preferably in the range of approximately 400 nm - 420 nm, according to the methods 
described m U.S. Patent No. 5,462,733 and Gasparro et al., Photochem. Photobiol. 57:1007 
(1993). In one aspect of this invention, a SATA with a fiiran sided monoadduct or 
monoadducted oligonucleotides for plac^ent on the 3* end of mRNAs, along with a 
nonadducted SATA are provided as the basis of a kit. 

Use of the SATA and the monoadduct in several embodiments of the current 
invention is particularly advantageous for in vitro translation systems. However, one 
skilled in the art will appreciate that in situ systems can also be used. Various 
embodiments of the current invention will be applicable to any in vitro translation system, 
including, but not limited to, rabbit reticulocyte lysate (KLL), wheat germ, E. coli, yeast 
lysate systems, etc. Many embodiments of the current invention are also well-suited for use 
in hybrid systems where components of different systems are combined. 

tiRNAs aminoacylated on a 3* amide bond are reported not to combine with the 
elongation factor EF-TU which assists in binding to the A site (Sprinzl and Cramer, Prog. 
Nuc. Acid Res. 22:1 (1979)). Such modified tRNAs do, however, bind to the A site. This 
binding of 3' modified tRNAs can be increased by changing the Mg** concentration 
(Chinali et al., Biochem. 13:3001 (1974)). The appropriate concentrations of and/or molar 
ratios of SATA and Mg"^ can be detemiined empirically. If the concentration or A site 
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avidity of SATA is too high, the SATA could compete with native tRNAs for non-cognate 
codons i.e., could function much like puromycin and stall translation. If the concentration 
or A site avidity of SATA is too low, the SATA might not effectively compete with the 
release factors, i.e., it would not act as an effective nonsense suppressor fRNA. The 
balance between these can be determined empirically. 

It is also believed that the elongation &ctor aids in proofireading the codon- 
anticodon recognition. The error rate in the absence of elongation factor and tiie associated 
GTP hydrolysis is estimated to be 1 in 100 for codons one nucleotide away (Voet and Voet, 
Biochemistry 2°^ ed. pp. 1000-1002 (1995), John Wiley and Sons). In a preferred 
embodiment, UAA is used as the linking codon. For UAA as the linking codon, there are 7 
non stop codons which differ by one amino acid. This is 7/61 or about 11.5% of the non 
stop codons. One can estimate the probability of miscoding a given codon as (0.01)(0.115) 
= 1.15 X 10'^ miscodes per codon. Thus, one would expect a miscode about every 870 
codons, a frequency which will not substantially impair performance of various methods of 
the current invention. In an alternative embodiment, UAG is used as the Unking codon. 

In several embodiments, appropriate concentrations of SATA and Mg** are used in 
the in vitro translation system, e.g. RRL, in the presence of the mRNA molecides in the 
pool, causing translation to cease when the libosome reaches the codon which pramits the 
SATA to accept the peptide chain (the linking codon described above). Within a short 
time, most of the linking codons will be occupied by SATAs within ribosomes. In a 
preferred embodiment, the system then will be irradiated with UV light, preferably at 
approximately 320 nm to 400 imi. Nucleic acids are typically transparent to, i.e. do not 
absorb, this wavelength range. Upon irradiation, the psoralen monoadduct will convert to a . 
crosslink connecting the anticodon and the codon by a stable covalent bond. 

In a preferred embodiment, the target mRNA is pre-selected. In another 
embodiment, the target mRNA is artificially produced. La an alternative embodiment, the 
target consists of messages native to the system under investigation, which may be 
unknown and/or unidentified. The ability to* use unknown and/or unidentified mRNAs is a 
particular advantage of several embodiments of the current invention. 

In several embodiments, once all the nascent proteins are connected to their cognate 
mRNAs, the ribosomes are released or denatured. Preferably, this is accompUshed by the 
depletion of Mg^ through dialysis, simple dilution, or chelation. One skilled in the art will 
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understand that other methods, mcluding, but not limited to, denaturation by changing the 
. ionic strength, the pH, or the solvent system can also be used. 

In several embodiments of the invention, the selection of cognate pairs will be based 
upon afSnity binding of proteins according to any of a variety of established methods, 
including, but not limited to, afGnity columns, immunoprecipitation, and many high 
throughput screening procedures. A variety of ligands may also be used, including, but not 
limited to, proteins, nucleic acids, chemical compounds, polymers and metals. In addition,, 
cell membranes or receptors, or even entire cells may be used to bind the cognate pairs. 
The selection can be positive or negative. That is, the selected cognate pairs can be those 
that do bind well to a ligand or those that do not. For instance, for a protein to accelerate a 
thermodynamically favorable reaction, i.e., act as an en2yme for that reaction, it should 
bind both the substrate and a transition state analog. However, the transition state analog 
should be bound much more tightly than the substrate. This is described by the equation 

^enzyme ^trans 
^(penzyme ^subst 

where the ratio of the rate of the reaction with the enzyme, kenzyme^ to the rate without, 
^(penzymey is equal to the ratio of the binding of the transition state to the enzyme Kfrans 
over the binding of the substrate to the enzyme K^ubst (Voet and Voet, Biochemistry T"^ 
ed. p.380, (1995), John Wiley and Sons). 

In a preferred embodiment, proteins which compete poorly for binding to the 
substrate but compete well for binding to the transition state analog are selected. 
Operationally, this may be accomphshed by taking the proteins that are easily eluted £rom a 
matrix with substrate or substrate analog boimd to it and are flie most difScult to remove 
from matrix with transition state analog bound to it. By sequentially repeating this 
selection and reproducing the p^roteins through repUcation and translation of the nucleic 
acid of the cognate pairs, an improved enzyme should evolve. AfOnity to one mtity and 
lack of affinity to another in the same selection process is used in several embodiments of 
the current invention. Selection can also be done by RNA in many embodiments. 

Once the selection has identified a population of cognate pairs it may be convenient 
to detach the mRNA strand from the SATA to reproduce it. This is not always necessary 
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but can be accomplished by irradiating the pairs with UV, preferably at approximately 313 
nm or just below. This has been identified as a wave length that will photoreverse the 



photoreversal to nucleic acid damage is estimated to be 1 photoreversal for danoage to 1 in 
600 bases (Cimino et al., Biochem 25:3013 (1986)). 

One skilled in the art will appreciate that the mBNAs can be reproduced in many 
ways, including, but not limited to, by RNA-depCTident RNA polymerases or by rev^e 
transcription and PGR. This can take place using mRNAs separated from the cognate pairs, 
e.g., using poly T or poly U to hybridize to the poly A tails of, for instance, native unknown 
messages or by leaving the cognate pairs intact and usiag oUgonucleotide primers that 
hybridize partially into the reading frame for known messages. Alternatively, commercial 
kits for rapid amplification of cDNA ends may be used. When this is used to evolve 
proteins and not just to select them, it would be preferable to sample at least one amino acid 
substitution at each position in the protein. 
The Replication Threshold 

A nominal minimum nimiber of repUcations for efficient evolution may be 
estimated using the following formulae. If there is a sequence which is n sequences in 
length, with a selective improvement r mutations away with a mutation rate of the 
probabiUty of generating the selective improvement on replication may be determined as 
follows: 

For r =1, probabihty of a mutation at the right point, p, times the probability that it 
mutated to the right one of the three nucleotides that are different from the starting point, 
i/5, times the probability that the oflier n-1 sites remain uimiutated, (l-pXri-r), or 



where, P = the probabihty of attaining a given change r mutations away. More generally, 
for all r values: 



It is instructive to compare the chances of finding an advantage one mutation away 
with the chances three mutations away. This is because, given the triplet genetic code, any 



psoralen crosslink to MAf and damage the nucleic acid minimally. The ratio of 





given codon can only change into nine other codons in one mutation. Indeed, it turns out 



-12- 



wo 01/90414 



PCT/USOl/16210 



that no codon can actually change into nine other amino acid codes in one mutation. The 
maximmn number of amino acids that can be accessed in one mutation is seven amino 
acids and there are only eight codons of the sixty-four that can do this. Most codons have 
five or six out of nineteen other anuno acids within one mutation. To reach all nineteen 
5 amino acids that are different fix>m the starting one requires, in general, three mutations. 

These three mutations cannot be sequential since the two intervening ones will not, in 
general, be selectively advantageous. Therefore we need to use steps fliat are, at least, three 
mutations in size (r=3) to use all 20 amino acids. 

For a mutation rate of .0067, which is that reported for "error-prone PGR**, using a 
10 message of 300 nucleotides, which gives a short protein of 1 00 amino acids: 

=1.51x10"^ 

Therefore, one would expect to need a threshold of : 



^ =6.64x10^ 



1.51x10 



15 replications at that mutation rate to reasonably expect to reach the next amino acid that is 

advantageous. This is not the replication to use since the binomial expansion shows that 
over 1/3 of trials (actually about 1/e) would not contain the given sequence with selective 
advantage. 

A poissbn approximation for large n and smaU p for a given /i can be calculated so 
20 that we can compute the general term when n is, say, of the order 10 ^ and p is of the order 

10'^. The general term of the approximation is: 

He** 

An amplification factor of greater than approximately 6/P ensures that evolution 
will progress with the use of all amino acids. This is useful when the production of novel 
25 proteios precliides the use of "shufOing" of preexisting proteins. 
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Limits on Purification 

Given a reversible binding where B and C compete for A: 



(1) 



\c\-k ^ 
(2) 



The total concentrations can be expressed: 
(3) 

[CI=[C]+[AC] 
(4) 

Dividing (3) by (4) 
[C]r=[c]+[^C] 

And substituting (1) and (2) for [B] and [C]: 
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Rearranging: 




5 Canceling the [A]'s in the numerator and denominator: 

[bI _ [AB]{k,+[A]} 
[C]r [AC]{K+[A]} 

Finally rearranging: 

10 

[AB] ^ [BUkc-^lA]) 

[AC] [ClrQc^^lA]) 

(Enrichment Factor) 

The above factor is termed the '^Enrichment Factor^'. The ratio of the total 
components is multiplied by this factor to calcidate the ratio of the bound components, or 
the emichment of B over C. The maximum enrichment factor is kc^s, when the* [A] is 
1 5 significantly smaller fihan kc or Ag. 

The emichment is limited by the ratio of binding constants. To emich a scarce 
protein that is boimd 100 times as strongly as its competitors, the ratio of that protein to its 
competitors is increased by 1 million with 3 emichments. To enrich a protein that only 
binds twice as strongly than its competitors, 10 enrichment cycles would gain only an 
20 enrichment of -1000. 

The following Example illustrate various embodiments of the present invention and 
are not intended in any way to limit the invention. 
EXAMPLE 1 : PRODUCTION OF THE SATA 

One skilled in the art will understand that the SATA can be produced in a number of 
25 different ways. For example, in a preferred embodiment, three fi:agments (Fig 1) are 

purchased from a commercial source (i.e. Dhaimacon Research Inc., Boulder, CO). 
Modified bases and a fragment 3 with a pre-attached puromycin on its 3' end and a PO4 on 
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its 3' end are included, all of which are available commercially. Three fragments are used 
to facilitate manipulatibn of the fragment 2 in forming the monoadduct. 

Yeast tRNAAla or yeast tRNAPhe is used; however, sequences can be chosen 
widely from known tRNA's. Preferably, sequences with only a limited number of U's in 
the portion that corresponds to the fragment 2 are used. Using a sequence with only a few 
U*s is not necessary because psoralen preferentially binds 5TJA3' sequences (Thompson J. 
F, et al Biochemistiy 21:1363). However, there would be less doubly adducted product to 
purify out if such a sequence was used. 

The fragment 2 is used in a helical conformation to induce the psoralen to 
intercalate. Accordingly, a complementary strand is required. RNA or DNA is used, and a 
sequence, such as poly C to one or both ends, is added to facihtate separation and removal 
after monoadduct formation is accomplished. 

The fragment 2 and the cRNA are combined in buffered 50 mM NaCl solution. The 
Tm is measured by hyperchromicity changes. The two molecules are re-annealed and 
incubated for 1 hour with the selected psoralen at a temperature -lO^'C less than the Tm. 
The psoralen is selected based upon the sequence used. For instance, a relatively insoluble 
psoralen, such as 8 MOP, has a higher sequence stringency but may need to be replenished. 
A more soluble psoralen, such as AMT, has less stringency but will fill most sites. 
Preferably, HMT is used. If a fragment 2 is chosen that contains more non-target U's, a 
greater stringency is desired. Decreasing the temperature or increasing ionic strength by 
adding Mg^ is also used to increase the stringency. 

Following incubation, psoralen is irradiated at a wavelength greater than 
approximately 400 nm. The irradiation depends on the wavelength chosen and the psoralen 
used. For instance, approximately 419 nm 20-150 J/cm2 is preferably used for HMT. This 
process will result in an almost entirely fiiran sided monoadduct. 
Purification of Monoadduct 

The monoadduct is then purified by HPLC as described in Sastry et al, J. 
Photochem. Photobiol. B Biol. 14:65-79. The fact that firagment 2 is separate &om 
firagment 3 facilitates the purification step because, generally, purification of monoadducts 
>25 mer is difficult (Spiehnann et al. PNAS 89: 4514-4518). 
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Ligation of Fragment 2 and 3 

The fragment 2 is ligated to the fragment 3 using T4 RNA ligase. The puromycin on 

the 3' end acts as a protecting group this is done as per Romaaiuk and Uhlenbeck, Methods 

in Enzymology 100:52-59 (1983). Joining of fragment 2+3 to the 3' end of fragment 1 is 
5 done according to the methods described in Uhlenbeck, Biochemistry 24:2705-2712 

(1985). Fragment 2+3 is 5' phosphorylated by polynucleotide kinase and the two half 

molecules are annealed. 

In an altemative method, significant quantities of ftiran sided monoadducted U will 

be formed by hybridizing poly UA to itself and irradiating as above. The poly UA will 
10 then be enzymatically digested to yield fiiran sided U which will be protected and 

incorporated into a tRNA analog by nucleoside phosphoramidite methods. Ofher methods 

of forming the psoralen monoadducts include the methods described in Gamper et al., J. 

Mol. Biol. 197: 349 (1987); Gamper et al,, Photochem. Photobiol. 40:29, 1984; Sastry et al, 

J. Photochem. Photobiol. B Biol. 14:65-79; Spiehnann et al. PNAS 89:4514-4518; U.S. 
15 Patent Number 4,599,303. 

SATAs generated by the methods described above will read UAG (anticodon 

CXJA). Additionally, UAA or UGA will be also be used. In various embodiments, any 

message that has the stop codon that is selected as the 'linking codon" is used. 

EXAMPLE 2: PRODUCTION OF PSORALENATED FURAN SIDED 
20 MONOADDUCTS FROM CUAGAYCUGGAGG RNA FRAGMENTS 

UV Light Exposure OfRNA:DNA Hybrids 

Equal volumes of 3 ng/ml RNAicRNA hybrid segments and of 10 |ig/ml HMT both 

comprised of 50mM NaCl are transferred into a new 1.5 ml capped polypropylene 

microcentrifuge tube and incubated at 37°C for 30 ntiinutes in the dark. This was then 
25 transferred onto a new clean culture dish. This is positioned in a photochemical reactor 

(419 imGi peak Soudiem New England Ultaviolet Co.) at a distance of about 12.5 cm so that 

iiradiance is —6.5 mW/cm2 and irradiated for 60-120 minutes. 

Removal of Low Molecular Weight Protoproducts 

100)xl of chloroform-isoamyl alcohol (24:1) is pipetted and mixed by vortex.. 
30 Mixture is centrifuged for 5 minutes at 15000 xg in a microcCTttrifiige tube. The 
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chlorofonn-isoamyl alcohol layer is removed with a micropipet. The chloroform-isoamyl 
alcohol extraction is repeated once again. Clean KNA is precipitated out of the solution. 

Alcohol Precipitation 

Two volumes (-1000 ^1) ice cold absolute ethanol is added to the mixture. The 
tube is centrifuged for 15 minutes at 15,000xg in a microcentrifuge. The supernatant is 
decanted and discarded and the precipitated RNA is redissolved in lOOfxl DEPC treated 
water then re-exposed to the RNA+8-MOP. 

Isolation of the Psoralentated RNA Fragments Using HPLC 

All components, glassware and reagents are prepared so that they are RNAase free. 
The HPLC is set up with a Dionex DNA PA- 100 package column. The psoralenated 
RNArDNA hybrid is wanned to 4°C. The psoralenated RNA is applied to HPLC followed 
by olignucleotide analysis, as described in the following section entitled "Olignucleotide 
Analysis by HPLC." The collected fractions will represent: 

a) 5'CUAGA^CUGGAGG3*, where ^ is pseudouridine (SEQ ID NO: 1) 

b) Furan sided 5*CUPsoralenAGATCUGGAGG3' monoadducts (SEQ ID 
NO: 2) 

c) .5'XXXXXCCUCCAGAUCUAGXXXXX3' (SEQ ID NO: 3) 

d) 5'XXXXXCCUCCAGAUCUPsoralenAGXXXXX3'(SEQIDNO:4) 

The fractions are stored at 4°C in new, RNAase free snapped microcentriftige tubes 
and stored at -20^C if more than four weeks of storage is required. 

Identification of the UNA Fragments R^resented by Each Peak Fraction Collected by 
HPLC Using Polyacrylamide Gel Electrophoresis (PAGE) 

The electrophoresis imit is set up in a 4*^C refrigerator. A gel is selected with a 2 
nun spacer. Each 5 ]i\ of HPLC fraction is diluted to 10 |iil with Loading Buffer. 10 jil of 
each diluted fraction is loaded into appropriately labeled sample wells. The tracking dye is 
loaded in a separate lane and electrophoresis is run as described ia the following section 
entitled *Tolyacrylamide Gel Electrophoresis (PAGE) of Psoralenated RNA Fragments." 
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After the electrophoresis run is complete, the electrophoresis is stopped when the tracking 
dye has reached the edge of the gel. The apparatus is disassembled. The gel-glass panel 
unit is placed on the UV light box. UV ligjits are turned on. The RNA bands are 
identified. The bands will appear as denser shadows under UV lighting conditions. 

Extraction of the RNA From the Gel 

Each band is excised with a new sterile and RNAase free scalpel blade and 
transferred into a new 1.5 ml snap capped nucrocentrifuge tube. Each gel is crushed 
against the walls of the microcentrifuge tubes with the side of the scalpel blade. A new 
blade is used for each sample. 1.0 ml of 0.3M sodium acetate is added to each tube and 
eluted for at least 24 hours at 4°C. The eluate is transferred to a new 0.5 ml snap capped 
polypropylene microcentrifuge tube with a micropipet. A new RNAase free pipette tip is 
used for each tube and the RNA with ethanol is precipitated out. 

Ethanol Precipitation 

Two volumes of ice cold ethanol is added to each eluate then centrifuged at 15,000 
xg for 15 minutes in a microcentrifuge. The supematants are discharged and the 
precipitated RNA is re-dissolved in 100 |j1 of DEPC treated DI water. The RNA is stored 
in the microcentrifuge tubes at 4°C until needed. The tubes are stored at -20°C if storage is 
for more than two weeks. The following is the assumed order of rate of migration for each 
fragment in order from fastest to slowest: 

a) 5*CUAGAYCUGGAGG3' 

b) Furan sided 5'CUPsoralenAGA^CUGGAGG3' monoadducts, 

c) 5*XXXXXCCUCCAGAUCUAGXXXXX3' 

d) 5'XXXXXCCUCCAGAUCUPsoralenAGXXXXX3* 

The tubes are labeled containing the remainder of each fi'action with the presumed 
chemical sequence and stored at -20°C. 
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Ethanol Precipitation of RNA 

RNA oligonucleotide fragments are precipitated, and all glassware is cleaned to 
remove any traces of RNase as described in the following section entitled **Inactivation of 
RNases on Eqtupment, Supplies, and in Solutions." All solutions are stored in RNAase 
5 free glassware and introduction of nucleases is prevented. Absolute ethanol is stored at O^'C . 

until used. Micropipetors are used to add two volumes of ice cold ethanol to nucleic acids 
tiiat are to be precipitated in microcentrifuge tubes. Capped microcentrifrige tubes are 
placed into the microfuge and spun at 15,000 xg for 15 nmiutes. The supematant is 
discarded and precipitated RNA is re-dissolved in DEPC treated Dl-water. RNA is stored 
10 at 4°C in miax)centrifuge tubes until ready to use. 

Ligation of RNA Fragments 2 and 3 

All glassware is cleaned to remove any traces of RNase as described in the 
following section entitled *Tnactivation of RNases on Equipment, Supplies, and in 
Solutions." The following is added to a new 1.5 nal polypropylene snap capped 
15 microcentrifuge tube using a 100-1000 \xl pipet and a new sterile pipet tip is used for each 

solution: 



Fragment 2 (3.0nM) 125.0^1 

Fragment 3 (3.0nM) 125.0|il 

Reaction buffer 250.0^1 

20 RNA T4 ligase (9-12U/ml) 42fil 
Reaction Buffer 

RNase free Dl-water 90.00ml 

Tris-HCl (50mM) 0.79g 

MgC12 (lOmM) 0.20g 

25 DTT (5mM) 0.078g 

ATP (ImM) 0.55g 
pHto7.8withHCL 

RNase free Dt-water QS to 100.00ml 
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The mixtxire is gently mixed and the RNA is melted by incubating the mixture at 
16**C for one hour in a temperature controlled refrigerated chamber. KNA is precipitated 
out of the solution immediately after the incubation is completed. 

Alcohol Precipitation 

5 Two volumes ('--1000 of ice cold absolute ethanol are added to the reaction 

mixture. The microcentrifuge tube is placed in a microcentrifuge at 15,000 xg for 15 
minutes. The supemate is decanted and discarded and the precipitated RNA is re-dissolve 
in 100 jil DEPC treated water. The mixture is electrophoresed as described in the 
following section entitled *Tolyacrylamide Gel Electrophoresis (PAGE) of Psoralenated 
10 KNA Fragmmts." The following is the assumed order of rate of nodgration for each 
fragment in order from fastest to slowest: 

a) Frag. 2 
5'CUAGATCUGGAGG3'-OH 

Psoralen 

15 

b) Frag. 3 

5'UCCUGUGT^CGAUCCACAGAAUUCGCACC-Puromycin (SEQ ID 

NO: 5) 

20 c) Frag 2+3 

5»CUAGAYCUGGAGGUCCUGUGT^CGAUCCACAGAAUUCGCACCPuromy 
cm (SEQ ID NO: 6) 
Psoralen 



25 Each fraction is isolated by UV shadowing, the bands are cut out, the RNAs are 

eluted from the gels and the RNA elute is precipitated out as described in the following- 
section entitled *Tolyacrylamide Gel Electrophoresis (PAGE) of Psoralenated RNA 
Fragments." 

The ligation procedure is repeated with any residual imligated fragment 2 and 3 
30 fractions. 
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The ligated fractions 2 and 3 are pooled and stored in a small volume of KNase free 
Dl-water at 4°C, 

Ligation of RNA Fragment 1 with Fragment 2+3 

All glassware is cleaned to remove any traces of KNase as described in the 
5 following section entitled "Inactivation of RNases on Equipment, Supplies, and in 

Solutions." The following is added to a new 1,5 ml polypropylene snap capped 
' naicrocentrifuge tube. A 100-1000 ill pipet and new tip is used for each solution: 

Fragment 2+3 (3.0nM) 125.0jil 
Reaction buffer 250.0p.l 
10 T4 Polynucleotide Kinase(5-10U/ml) 1.7\xl 

Reaction RiifTer 



RNase free Dl-water PO.OOntil 

15 Tris-HCl (4QmM) 0.63g 

MgC12 (lOmM) 0.20g 

DTT (5mM) 0.08g 

ATP (ImM) 0.006g 
pH to 7.8 with HCL 
20 RNase free Dl-water QS to 100.00ml 



The RNA is gently nodxed then melted by heating the mixture to 70°C for 5 minutes 
in a heating block. The mixture is cooled to room temperature over a two hom: period and 
the RNAs is allowed to anneal in a tRNA configuration. The RNA is precipitated out of 
25 the solution. 

Alcohol Precipitation 

Two volumes (-1000 p.1) if ice cold absolute ethanol are added to the reaction 
mixture. The microcentrifuge tube is placed in a microcentrifuge at 15,000 xg for 15 
minutes. The supemate is decanted and discarded and the precipitated RNA is re-dissolved 
30 in 100 |il DEPC treated water. The mixture is electrophoresed as described in the 
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following section , entitled 'Tolyacrylamide Gel Electrophoresis (PAGE) of Psoralenated 
RNA Fragments." The following is the assmned order of rate of migration for each 
. firagment in order &om fastest to slowest: 

a) Frag. 1 

5'GCGGAUUUAGCUCAGDDGGGAGAGCGCCAGACU3* 

b) Frag 2+3 

S'CUAGAYCUGGAGGUCCUGUGWCGAUCCACAGAAUUCGCACCPuromy 

cin 

Psoralen 

c) Frag. 1+2+3 

5'GCGGAUUUAGCUCAGDDGGGAGAGCGCCAGACUCUAGA^CUGGAGG 
UC. . .CUGUGT^CGAUCCACAGAAUUCGCACCPuromycin (SEQ ID NO: 7) 
Psoralen 

Each firaction is isolated by UV shadowing, the bands are cut out, the RNAs are 
eluted from the gels and the RNA elute is precipitated out as described in the following 
section entitled 'Tolyacrylamide Gel Electrophoresis (PAGE) of Psoralenated RNA 
Fragments." The Ugation procedure is r^eated with the unhgated Fragment 1 and the 2+3 
Fraction. The ligated fractions 2 +3 are pooled and stored in a small volume of RNase free 
Dl-water at 4''C. 

Final RNA Ligation 

The following is added to a new 1.5ml polypropylene snap capped microcentrifuge 
tube. A 100-1000 p,l pipet and new tip is used for each solution: 

250^1 
250h1 
22ixg 



Fragment 1+2+3 (3.0nM) 

reaction buffer 

RNA T4 Ugase (44p.g/ml) 
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The mixture is incubated at IT^'C in a temperature controlled refrigerator for 4.7 
hours. Immediately after the incubation the tRNA is precipitated out as described in step 
6.2 above and the tRNA is isolated by electrophoresis as described in the following section 
entitled *Tolyacrylamide Gel Electrophoresis (PAGE) of Psoralenated RNA Fragments." 
5 The tRNA is pooled in a small volume of RNase free water and stored at 4^C for up to two 
weeks or stored at -20®C for periods longer than two weeks. 

Polyacrylamide Gel Electrophoresis (PAGE) of Psoralenated RNA Fragments 

Acrylamide Gel Preparation 

10 All reagents and glassware must be RNAase free as described in the following 

section entitled "Inactivation of RNases on Equipment, Supplies, and in Solutions." The 
gel apparatus is assembled to produce a 4 mm thick by 20 cm x 42 cm square gel. 29 parts 
acrylamide with 1 part ammonium crosslinker are mixed at room temperature with the 
appropriate amount of acrylamide solution in an RNAase free, thick walled Erlenmeyer 

15 flask. 

Acrylamide Solution 
urea (7M) 420.42 g 
TBE(IX) QStolL 

20 SXTBE 

0.455 M Tris-HCl .53.9g 

lOmM EDTA 20ml of 0.5 M 

RNAase free DI water 900ml 

pH with boric acid to pH 9 

25 QS with RNAase free DI water to IL 



The mixture is degassed with vacuum pressure for one minute. The appropriate 

amount of TEMED is added, mixed gently, and then the gel mixture is poured between the 

glass plates to within 0.5 cm of the top. The comb is immediately inserted between the 

30 . glass sheets and into the gel mixture. An RNAase free gel comb is used. The comb should 

produce wells for a 5 mm wide dye lane and 135 mm sample lanes. The gel is allowed to 
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polymerize for about 30-40 minutes then the comb is carefully removed. The sample wells 
are rinsed out with a running buffer using a micropipet with a new pipet tip. The wells are 
then filled with nmning buffer. 

Sample Preparation 

An aliquot of the sample is suspended in loading buffer in a snap capped 
microcentrifuge tube and vortex mixed. Indicator dye is not added to the sample. 

Loading Buffer 

Urea (7M) 420.42 g 

Tris HCl (50mM) 7.85 g 

QS with RNAase firee D-H20 to IL 



Electrophoresis run 

The maximum volume of RNA/loading buffer solution is loaded iuto the 135 Tnnn 
sample wells and the appropriate volume of tracking dye in 5 mm tracking lane. The 
samples are electrophoresed in a 5**C refrigerator. The electrophoresis is stopped when flie 
tracking dye has reached the edge of the gel. Disassemble the apparatus. Glass panels are 
not removed the firom the gel. The gel-glass panel unit is placed on a UV light box. With 
UV filtering goggles in place, the UV lights are tumed on. The RNA bands are identified. 
They appear as denser shadows mder UV lighting conditions. The RNA is extracted from 
the gel. Each bands is excised with a new sterile and RNAase firee scalpel blade and each 
band is transferred into a new 1.5 ml snap capped microcentrifuge tube. Each gel is 
crushed against the walls of the microcentrifuge tubes with the side of the scalpel blade. A 
new blade is used for each sample. 1.0 ml of 0.3M sodium acetate is added to each tube 
and eluted for at least 24 hours at 4°C. The eluate is transferred to a new 0.5 ml sn^ 
capped polypropylene microcentrifuge tubes with a micropipet with a new RNAase firee 
pipet tip for each tube. Two volumes of ice cold ethanol is added to each eluate, then 
centrifiiged at 15,000xG for 15 minutes in a microcentrifuge. The supematants are 
discarded and the precipitated RNA is redissolved in 100 (xl of DEPC treated DI water. 
The RNA is stored in the microcentrifuge tubes at 4°C until needed. 
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Olignncleotide Analysis by HPLC 

HPLC purification of the RNA oligonucleotides is best effected using anion 
exchange chromatography. Either the 2*-protected or 2'-deprotected forms can be 
5 chromatographed. The 2 -protected form offers the advantage of minimizing secondary 

structure effects and provides resistance to nucleases. If the RNA is fiilly deprotected, 
sterile conditions are required during purification. 
Deprotection of 2 -Orthoester protected RNA 

The tubes are centrifiiged at 15,000xg for 30 seconds or until the RNA pellet is at 
10 the bottom. 400 |il of pH 3.8 deprotection buffer is added to each tube of RNA. 

. Deprotection Btiffer 

Acetic acid (lOOmM) is adjusted to pH 3.8 with tetramethylethylenediamine 
(TEMED). The pellet is completely dissolved in the buffer by drawing in and out of a 
pipette. The tubes are vortexed for 10 seconds and centrifiiged at 15,000xg. The tubes are 
15 incubated in a 60 water bath for 30 minutes. The samples are lyophilized before use. 

HPLC Column Conditions 

A 4 X 250 mm column (DNAPAC PA, No. 043010) packed with Dionex (800)- 
DIONEX-0 (346-6390), with a capacity of 40 optical density units (ODU) at 260 nm is 
installed. The colimm temperature is set to 54 °C. The injection volume is adjusted such 
20 that 5 \il produces approximately 0.20 ODU 

Elution buffers 



Condition 


Buffer A 


Buffers 


Sodium perchlorate 


(5mM) 2.8 g 


(300mM)168.O g 


Tris-HCl 


2.4 g 


2.4 g 


Acetonitrile (2%) 


80.0 ml 


80.0 ml 


DI Water 


3900 ml 


900 ml 


Adjusted pH 


8.0 withHCL 


8.0 WithHCL 


q.s. 


4000 ml 


4000 ml 
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HPLC Gradient 

A 30% to 60% gradient of Buffer B for oligos 17-32 base pairs long is provided: 



Time 
(minutes) 


Flow 
(ml/min) 


%A 


%B 


Curve 


0 


1.5 


100 


0 


* 


1 


1.5 


100 


0 


6 


3 


1.5 


70* 


30* 


6 


15 


1.5 


40* 


60* 


6 


15.5 


2.5 


0 


100 


6 


17 


2.5 


0 


100 


6 


17.25 


2.5 


100 


0 


6 


23 


2.5 


100 


0 


6 


23.1 


1.5 


100 


0 


6 


24 


1.5 


100 


0 


6 


25 


0.1 


100 


0 


6 



* % values that can be changed to modify the gradient. Typical gradients are 0-30%, 20- 
50%, 30-60%, and 40-70% of Buffer B. 

Gradient selection 

The gradient is selected based upon the number of bases, as follows: 



Number of bases 


Gradient 


0-5 


0-30 


6-10 


10-40 


11-16 


20-50 


17-32 


30-60 


33-50 


40-70 


>50 


50-80 



After HPLC, the target samples are collected and the RNA concentration is 
determined with a spectrophotometer at 260 nm. The samples are stored at -70 °C, 

Inactivation of RNases on Equipment, Supplies, and in Solutions 

Glassware is treated by baking at 180**C for at least 8 hours. Plasticware is treated 
by rinsing with chloroform. Alternatively, all items are soaked in 0.1% DEPC. 

Treatment with 0.1% DEPC 

0.1% DEPC is prepared. DI water is filtered through a 0.2|iM membrane filter. 
The water is autoclaved at 15 psi for 15 minutes on a liquid cycle. l.Og (wt/v) DEPC/liter 
of sterile filtered water is added. 
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Glass and Plasticware 

All glass and plasticware is submerged in 0.1% DEPC for two hours at 37°C. The 
glassware is rinsed at least 5X with sterile DI water. The glassware is heated to lOO^C for 
IS minutes' or autoclaved for IS minutes at IS psi on a liquid cycle. 

S Electrophoresis Tanks Used for Electrophoresis of RNA 

Tanks are washed with detergent, rinsed with water then ethanol and air dried. The 
tank is filled with 3% (v/v) hydrogen peroxide (30ml/L) and left standing for 10 minutes at 
room temperature. The tank is rinsed at least 5 times with DEPC treated water. 

Solutions 

10 All solutions are made using Rnase firee glassware, plastic ware, autoclaved water, 

chemicals reserved for work with RNA and KNase firee spatulas. Disposable gloves are 
used. When possible, the solutions are treated with 0.1% DEPC for at least 12 hours at 
37*'C and then heated to 100°C for IS minutes or autoclaved for 15 minutes at 15 psi on a 
Hquid cycle. 

IS RNA Translation 

2 |xl of gastroinhibitory peptide (GIP) mRNA at a concentration of 20 p.l/ml is 
placed in a 250 \i\ snapcap polypropylene microcentrifuge tube. 35 |il of rabbit reticulocyte 
lysate (available commercially fi-om Promega) is added. 1 jil of amino acid mixture which 
does not contain methionine (available commercially firom Promega) is added. 1 (xl of 35S 
20 methione or unlabeled methionine is added. Optionally, 2 ml of ludferase may be added to 
some tubes to serve as a control. 

SATA is added to the experimental tubes. Control tubes which do not contain 
SATA are also prepared. The quantity of SATA used is approximately between O.lug to 
500 |Lig, preferably between 0.5 p.g to 50 ^g, 1 jxl of Rnasin at 40 units/ml is added. 
25 Nuclease firee water is added to make a total volume of 50 jxL 

For proteins greater than approximately 150 amino acids, the amount of tRNA 
may need to be supplemented. For example, approximately 10 - 200 iig of tRNA may be 
added. In general, the quantity of the SATA should be high enough to effectively suppress 
stop or pseudo stop codons. The quantity of the native tRNA must be high enough to out 
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compete the SATA which does not xmdergo dynamic proofreading under the action of 
elongation factors. 

Each tube is immediately capped, parafilmed and incubated for the translation 
reactions at 30**C for 90 minutes. The contents of each reaction tube is transferred into a 50 
5 |xl quartz c^illary tube by capillary action. The SATA is crosslinked with mENA by 

illuminating the contents of each tube with 2-10 J/cm2 —350 imi wavelength light, as per 
Gasparro et al. (Photochem. Photobiol. 57:1007 (1993)). Following photocrosslinking, the 
contents of each tube is transferred into a new snapcap microfuge tube. The ribosomes are 
dissociated by chelating the calcium cations by adding 2 |il of 10 mM EDTA to each tube. 
10 Between each step, each tube is gently mixed by stirring each component with a pipette tip 

upon addition. 

The optimal RNA for a translation is determined prior to performing definitive 
experiments. Serial dilutions may be required to find flie optimal concentration of mRNA 
between 5-20 |xg/mL 

15 



Reagent 


1 


2 


3 


4 


Rabbit reticulocyte lysate (35fxl) 






+ 


+ 


Amino acid mixture minus 
methionine (l|j,l of 1 mM) 


+ 


+ 
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^'S Methionine (Ijxl of 
l,200Ci/mmol) 








+ 


Methionine (unlabeled) 






+ 




GIP mRNA (2^1 of 20 fig/ml) 


+ 








'^P GIP mRNA (2^x1 of 20 p-g/ml) 




+ 


+ 




Rnasin (1^1 of 40 U/^il) 


+ 








SATA 










Water, nuclease firee (q.s. to 50 pi) 






+ 


4- 
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SDS-Page electrophoresis is performed on each sample, as described above. 
Autoradiognqphy on the gel is performed, as described by Sambrook et. al.. Molecular 
Cloning, A Laboratory Manual, 2^ ed., Coldspring Harbor Press (1989). 

5 

While a number of preferred embodiments of the current invention and variations 
thereof have been described in detail, other modifications and methods of use will be 
readily apparent to those of skill in the art. Accordingly, it should be imderstood that 
various appUcations, modifications and substitutions may be made without departing from 
10 the spirit of the invention or title scope of the claims. 
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WHAT IS C LATMFn TS: 

1 . A method for selecting a desired protein or nucleic acid molecule comprising: . 

providing at least two candidate mRNA molecules, wherein at least one of 
5 said mRNA molecules contains at least one codon selected fix>m the .groiq) 

consisting of a stop codon and a pseudo stop codon; 

translating at least two of said candidate mRNA molecules to generate at 
least one translated protein; 

linking at least one of said candidate mRNA molecules to its corresponding 
10 translated protein to fomi at least one cognate pair; 

selecting one or more of said cognate pairs based upon the properties of said 
translated protein or said mRNA molecule; and 

selecting said translated protein or said desired nucleic acid molecule 
comprising identifying a molecule selected from the group consisting of an mRNA 
15 molecule of said selected cognate pair, a nucleic acid molecule complementary to 

said mRNA molecule and a nucleic acid molecule homologous to said mRNA 
molecule. 

2. The method of Claim 1, wherein at least one of said candidate mRNA molecules 
20 and at least one said translated protein is linked by a tRNA molecule selected from the 

group consisting of tRNA, modified tRNA and tRNA analogs. 

3. The method of Claim 2, wherein said tElNA molecule is connected to said 
translated protein by ribosomal peptidyl transferase. 

25 

4. The method of Claim 2, wherein said tRNA molecule is connected to the 
mRNA through an ultraviolet-induced cross link between flie anticodon of said tRNA 
molecule and the codon of the RNA message. 

30 5. The method of Claim 2, wherein said tRNA molecule is connected to the 

mRNA by thiouracil. 
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6. The method of Claim 2, wherein said tRNA molecule is comiected to the 
mRNA by a psoralen cross link. 

7. The method of Claim 2, wherein said tElNA molecule is connected to the 
mRNA by using a psoral^ monoadduct. 

8. The method of Claim 7, wherein said psoralen monoadduct is placed on said 
mRNA. 

9. The method of Claim 7, wherein said psoralen monoadduct is placed on an 
anticodon of said tRNA molecule. 

10. The method of Claim 7, wherein said psoralen monoadduct is placed on a stop 
anticodon of said tRNA molecule. 

11- The method of Claim 7, wherein said psoralen monoadduct is generated by 
forming crosslinks and photo reversing said crosslinks or by using visible hght. 

12. The method of Claim 2, wherein said fRNA molecule has a stable peptide 
acceptor. 

13. The method of Claim 2, wherein said tRNA molecule is modified to accept a 
peptide chain and hold said chain in a stable mann^, such that peptidyl transferase caimot 
detach it. 

14. The method of Claim 2, wherein said tRNA molecule is modified by using a 
bond which binds to the ribosome, accepts the peptide chain, and then does not act as a 
donor in the next transpeptidation. 

15. The method of Claim 14, wherein said bond is selected from the group 
consisting of a 2* ester on a 3" deoxy adenosine and an amino acyl tEtNA^^^- 

16. The method of Claim 2, wherein an amino acid or amino acid analog is attached 
to the 3* end of said tRNA molecule by a stable bond to generate a stable aminoacyl tRNA 
analog. 
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17. The method of Claun 1, wherein said translating is performed in vitro. 

18. The method of Claim 1, wherein said translating is performed in situ. 

19. The method of Claim 1; wherein said cognate pair is selected based upon ligand 
binding. 

20. The method of Claim 1, wherein selecting said desired nucleic acid molecule 
comprises: 

having an array of nucleic acids, wherein said nucleic acids are placed in a 
predetermined position; 

hybridizing at least one of said cognate pairs onto said array; and 
identifying the mRNA based upon its reaction to a protein identifier. 

21 . The method of Claim 20, wherein said reaction to a protein identifier is selected 
firom the group consisting of ligand binding, immimoprecipitation and enzymatic reactions. 

22. A kit to generate cognate pairs comprising at least one psoralen monoadduct 
attached to a nohadducted stable aminoacyl tRNA analog or at least one psoralen 
monoadduct attached to an oligonucleotide. 

23. A method for evolving a desired protein sequence comprising: 

selecting at least one of said first cognate pairs based upon at least one 
desired characteristic of said translated first protein or said mRNA according to the 
method of Claim 1 ; 

recovering at least one of said first cognate pairs with said desired 
characteristic to generate at least one recovered cognate pair; 

amplifying a first nucleic acid component of one or more of said recovered 
cognate pairs; 

producing at least one second nucleic acid component comprising at least 
one of said first nucleic acid components with one or more variations; 
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producing at least one second protein by translating at least one of said 
second nucleic acid components; 

linking at least one of said second proteins with at least one of said second 
nucleic acid components to generate one or more second cognate pairs; and 
S obtaining the desired protein sequence by re-selecting one or more of said 

second cognate pairs based upon at least one desired property, wherein said desired 
property is the same or different than said desired characteristic. 

24. The method of Claim 23, wherein said desired characteristic is selected from the 
10 group consisting of binding properties, enzymatic reactions and chemical modifications. 

25. The method of Claim 23, wherein selecting at least one of said first cognate 
pairs comprises: 

providing a first ligand with a desired binding characteristic; 
15 contacting one or more of said first cognate pairs with said first ligand to 

generate imboimd complexes and bound complexes; 

recovering either the bound complexes or the unboimd complexes; 
amplifying at least one nucleic acid component of the recovered complexes; 
introducing variation to a sequence of said nucleic acid component of said 
20 recovered complexes; 

translating one or more second proteins from said nucleic acid components, 
linking at least one of said second proteins with at least one of said second 
nucleic acid components to generate one or more second cognate pairs; and 

obtaining the desired protein sequence by contacting said at least one of said 
25 second cognate pairs with at least one second ligand to select one or more of said 

second cognate pairs, wherein said second Ugand is the same or different than said 
first Ugand. 
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Figure 1 
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Figure 4 
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